Abstract. Recently Wolff [28] obtained a sharp L 2 bilinear restriction the- 
orem for bounded subsets of the cone in general dimension. Here we adapt 
the argument of Wolff to also handle subsets of "elliptic surfaces" such as 
paraboloids. Except for an endpoint, this answers a conjecture of Machedon 
and Klainerman, and also improves upon the known restriction theory for the 
paraboloid and sphere. 
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1. Introduction 

Let n > 1 be a fixed integer, and let S be a smooth compact hypersurface with 
boundary in the space R x R n := {(r, £) : r S R, £ G R n }, which we shall interpret 
as the spacetime frequency space. If < p,q < oo, we say that the linear adjoint 
restriction estimate R* s (p — > q) holds if one has an estimate of the form 

(!) H/Hli^RxR") - C P,q,s\\f\\LP(S,da) 

for all test functions / on S, where 

F{t,x) := [ F{T^)e 2m(tT+x<) 



is the spacetime Fourier transform. The restriction problem 1 for S is to determine 
for which p, q the estimate R<j(p — > q) holds. This problem was posed by Stein 
[16], and is related to other outstanding problems in harmonic analysis such as the 
Bochner-Riesz conjecture, local smoothing conjecture, and Kakeya conjecture; see 
e.g. [5], [27], [19] for further discussion. In one spatial dimension n = 1, the problem 
is mostly solved, but in two and higher spatial dimensions the problem remains far 
from settled (except in special cases such as p = 2) , despite much recent progress. 

It has been known for several decades that one can attack this conjecture in the 
special case q — 4 by squaring both sides of the linear estimate (1) and studying 
the resulting bilinear L 2 estimate; see e.g. [7], [15], etc. Variants of this idea have 
also been very useful for nonlinear dispersive equations, see e.g. [1], [9], etc. More 
recently, the same idea has been applied to more general values of q, see [4], [22], 
[23] , [24] , [28] , [29] . More precisely, for any two smooth compact hypersurfaces Si , 
S 2 with boundary in R x R" , with Lebesgue measure do\ and da 2 respectively, we 
say that the bilinear adjoint restriction estimate Rg Sa (2 x 2 — > q) holds if one has 

\\fida 1 f 2 da 2 \\ Lq( ji x ji"- ) < Cg^^WfiW^iSud^WMlL^is^^i)- 



1991 Mathematics Subject Classification. 42B15, 35Q55. 

1 Historically, the restriction problem asks for which exponents q' is it true that the Fourier 
transform of an L q (R X R n ) function can be meaningfully restricted to S. This is essentially the 
adjoint of the above problem; see [17] for further discussion. 
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for all test functions fi, f 2 supported on Si, S2 respectively 2 . 

The linear and bilinear estimates are closely related; for instance when Si = 
S 2 = S, then R s S2 (2 x 2 — > q) is clearly equivalent to R* s (2 — > 2g). However, 
it was observed in [4], [22] that further estimates are available if Si and S 2 are 
not equal, and in particular if they satisfy some sort of transversality condition. 
For instance, if the normals of Si and of S2 are separated by at least some fixed 
angle c > 0, then one can easily obtain the bilinear estimate R s S; (2 x 2 — > 
2) by PlanchercPs inequality and Cauchy-Schwarz, even in cases where the linear 
estimates Rg (2 — > 4), Rg 2 (2 — > 4) fail. Furthermore, these bilinear restriction 
estimates can then be used (via some rescaling and interpolation arguments) to 
obtain new linear restriction estimates; see [22], [23], [28] for some examples of this. 

Two important examples of surfaces Si, S2 are: (a) compact, transverse subsets 
of the light cone 

{(T,£)eRxR":|r| = |£|}; 
and (b) compact, transverse (i.e. disjoint) subsets of the paraboloid 3 

(2) S:={(r,OeRxR":r = -i|e| 2 }- 

Apart from being model examples for the bilinear restriction problem, they also 
have direct application to nonlinear wave and Schrodingcr equations respectively. 
In 1997, Machedon and Klainerman observed that in these two cases, the estimate 
R* Si S2 (2 x 2 — > q) can only hold when q > , and conjectured that this necessary 
condition was in fact sufficient (see [22] , [28] for further discussion) . The paraboloid 
(2) also serves as models for other surfaces with non-vanishing Gaussian curvature, 
such as the sphere; see the remarks section for further discussion. 

Of the two cases (a) and (b), the cone problem was generally thought to the more 
difficult of the two (compare for instance [4] with [3]). It was thus a surprise when 
Wolff [28] established the Machedon-Klainerman conjecture for the cone in all non- 
endpoint cases q > (with the endpoint case being attained shortly afterward 
in [21]). A key geometrical observation was that if one took the union of all the 
lines through a fixed origin xq which were normal to S2, then any line normal to 
Si could only intersect this union in at most one point; this is ultimately due to 
the single vanishing principal curvature on the cone, which forces all of the above 
lines to be light rays. The analogous statement for the paraboloid however is false, 
so one cannot directly apply Wolff's argument to case (b). Even in two spatial 
dimensions n = 2, the Machedon-Klainerman conjecture had only been verified in 
this case for q > 2 — ^ (sec [23]), instead of the conjectured q > 2 — |. 

In this paper we adapt Wolff's argument in [28] to overcome this geometric 
obstruction: 

Theorem 1.1. Let Si and S2 be any disjoint compact subsets of the paraboloid S 
defined in (2). Then we have R* Si g 2 (2 x 2 — ► q) for any q > ^pf • In particular, 
the Machedon-Klainerman conjecture is true up to endpoints for the paraboloid. 



2 One can of course place /1 and /2 in Lebesgue spaces other than L 2 (see e.g. [22]), but we 
shall not need to do so within this paper. 

3 The choice of normalization factor — i may appear odd, but this is to ensure that waves of 
frequency £ travel at group velocity £. 
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By the general theory of linear and bilinear restriction theorems (see [22] , [23] ) , 
Theorem 1.1 implies some new progress on the restriction conjecture for paraboloids 
[16]. This conjecture asserts that R*(p — > q) holds 4 whenever q = ^^p' and 
q > 2(n + l)/n, where l/p+ 1/p' = 1; these conditions are known to be necessary. 

Corollary 1.2. TTie restriction conjecture for paraboloids is true for q > 2(n + 
3)/(n+l). 

Proof This follows directly from Theorem 1.1 in this paper and Theorem 2.2 in 
[22] , together with the observation that one can freely raise the exponents p, q in 
the estimate R Si S2 (p x p — > q). See [22], [23] for more examples of this type of 
argument. ■ 

In [18] this conjecture was verified for q > 2(n + 2)/n; in the special case n = 2, 
the best known previous result was q > 4 — ^ (see [23]); the above Corollary 
improves this to q > 4 — |. (It is conjectured that this bound holds in fact for all 
9 > 3). 

A similar result holds for all other positively curved surfaces, such as the sphere; 
we discuss this in Section 9. 

Functions of the form u := fda, where da is surface measure on S, can easily be 
seen to solve the free Schrddinger equation 

(3) Amu t - Au = 0. 

The factor An is an artifact of our conventions and should be ignored. We shall 
call solutions to (3) free Schrddinger waves. For any free Schrodinger wave, the 
quantity ll u ( i )ll^ 2 (P li ") i s an invariant of time, and shall be referred to as the total 
probability P(u) of the wave 5 . Observe that 



(4) /'i.M'V / 



ill 2 



for any compact subset Sj of S. 

Corollary 1.3. Let N > 0, and let u\, u 2 be two solutions to the Schrddinger 
equation (3), such that Uj(t) has Fourier transform supported in the region < N 
for j = 1,2. Suppose also that the Fourier supports ofuj(t) are separated by at least 
> cN. Then for any q > we have the spacetime estimate 



\UlU 2 \\ L i 



Proof By scale invariance one can take N — 1. The claim then follows directly 
from Theorem 1.1 and (4). ■ 

Such a statement implies various bilinear estimates for X s ' b norms for Schrodinger 
and wave equations, see e.g. [21] for a discussion. It is also likely that this sort 
of estimate has application to nonlinear Schrodinger equations; for instance, one 
can combine this estimate with the arguments in [14] to obtain new well-posedness 
results for certain non-linear Schrodinger equations in Besov spaces. 



4 The numerology is shifted by one from that in [16] because we are working in R X R n instead 
ofR n . 

5 This quantity plays the role of the energy for solutions to the wave equation, see [21]. 
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Another application to Schrodingcr equations was noted in [24]. Indeed, from 
Theorem 2.1 in [24] and Theorem 1.1 of this paper we see immediately that H s 
solutions to (3) converge pointwise to the initial data as t — > for n = 2 and 
s > 2/5; this improves upon the result of s > 15/32 given in that paper, but does 
not reach the conjectured level of s > 1/4. In higher dimensions n > 2, a direct 
modification of the arguments in [24] gives convergence for s > n/(n + 3). 

The author is a Clay Prize Fellow and is supported by the Packard Foundation. 
The author also thanks Fabrice Planchon for helpful comments, and the anonymous 
referee for careful reading of the paper and many cogent suggestions (which have 
since been incorporated into the paper). 

2. Notation 

If X is a finite set, we use j^X to denote its cardinality; if X is a measurable 
set, we use \X\ to denote its Lebesgue measure. 

If (t,x) € R x R™ is a point in spacetime, we use B((t,x),r) to denote the 
spacetime ball 

B((t,x),r) := {(t',x r ) e R x R" : \(t' , x') - (t, x)\ < r} 

and D(x,r) to denote the spatial disk 

D(x, r) := {x' G R" : \x' - x\ < r}. 

We use A < B or A = O(B) to denote the estimate \A\ < CB, where C is a 
constant depending only on n. 

Very shortly, our estimates shall involve a large parameter R 3> 1. We shall use 
AS B to denote the estimate A < C £ R £ B for all e > 0; in particular we note that 
(logi?) c < 1 for any C. 

3. Reduction to localized restriction estimates 

We now begin the proof of Theorem 1.1. Our arguments closely follow that of 
Wolff [28] , but with one additional twist near the end. The argument is organized as 
follows. In this section we make a preliminary reduction to the problem of obtaining 
sufficiently good localized restriction estimates, and then set up the induction ar- 
gument we will use to obtain such estimates. In the next section we recall the wave 
packet decomposition of Schrodingcr waves, which has been fundamental to all of 
the recent developments in restriction theory for these waves. Then, in Section 5, 
we use the inductive hypothesis to strip away a certain "localized" component of 
the estimate, and reduce ourselves to considering only the "global" portion. To 
estimate this global expression we perform a standard fine scale decomposition of 
space in Section 6, splitting the problem into obtaining a fine-scale estimate and 
then a coarse-scale estimate. The fine-scale estimate is purely Fourier-analytic and 
is estimated using PlancherePs theorem in Section 7; our innovation here is to ex- 
ploit an additional constraint on frequencies arising from the codimension 1 nature 
of the frequency space hypersurface S. The coarse-scale estimate is a geometric 
combinatorics estimate of Kakeya type, and is proven by the standard Bourgain- 
Wolff "bush" counting argument; the key point is that the constraint on frequencies 
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from the fine-scale analysis translates 6 to a constraint on directions in the coarse- 
scale estimate, thus restricting the bush to a hypersurface. This puts us in the 
situation to apply Wolff's counting argument from [28], which then concludes the 
proof. 

We now turn to the details. Fix Si, S2. By a finite partition of Si and S2, 
exploiting the compactness hypothesis, we may assume that diam(Si), diam(S2) <C 
dist(Si, S2). After a suitable rotation, scaling, and Gallilean transformation (the 
latter effectively translates £ by an arbitrary amount while keeping S invariant), 
one may thus assume that 

Si:= { (r,OeS:^-ei\<^- n } 

and 

where ei is a standard unit basis vector. We shall also need the slight enlargements 
Si : ={(T,£)€S:|£-ei|<_L} 

and 

S2:=«T,0eS<±}, 

Following Wolff [28], our first step is to reduce matters to proving a localized 
restriction estimate in which we are permitted to lose epsilon powers of the local- 
ization scale R. 

Definition 3.1. We use R*s lX s 2 (^ x 2 — > q,a) to denote the estimate 

\\fldaif2da 2 \\Li(B((t ,x ),R) < C q ^ u s 2 , a R a \\fl\\L^(S 1 .da 1 )\\f2\\L^(S 2 Ma 1 ) 

for all smooth fi, f 2 on Si, S 2 , all R > 1, and all spacetime balls B((t ,x a ),R) of 
radius R. 

To prove Theorem 1.1, it suffices by standard "epsilon- removal" lemmas 7 to 
prove the local estimate 

3 

n+T 
for all a > 0. 

To prove (5) we use Wolff's induction on scale argument. It is easy to see that 
the above estimate must be true for sufficiently large a; for instance, one can use 
the crude bound ||/jrfcj||oo < C 1 1 / j' 1 1 2 to obtain (5) for some large a. The claim 
will then follow (as in Wolff [28]) from the following inductive statement. 



(5) 4 lxS2 (2x2^, Q ). 



6 The linkage between fine scales and coarse scales is provided (heuristically, at least) by the 
dispersion relation, which asserts that the frequency of a wave at fine scales determines the (group) 
velocity of that wave at coarse scales; in the physical interpretation of the Schrodingcr equation, 
this relation is codified by de Broglie's law p = tt^. To make this heuristic mathematically rigorous, 
the wave packet decomposition is an ideal tool. 

7 For instance, one can apply Lemma 2.4 from [23]; see also Section 4 of [4], or Section 8 of [10]. 
In all of these arguments (which are of Tomas-Stein type) the key fact is that the surface measures 
on Si and 52 has a Fourier transform which decays at infinity; this is ultimately a consequence 
of the non-vanishing curvature of these surfaces. 
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Proposition 3.2. Suppose a > is such that (5) holds. Then we have 

n 4- 3 

R} 1XS2 (2 x 2 -> -X_ max((l - J)a, C<5) + Ce) 

for all < (5, e <C 1, where the constants C are independent of 5 and e. 

By choosing <5 and e suitably we may make max((l — 5)a, CS) + Ce equal to 
a — co? for some small absolute constant c. Iterating this we thus see that the 
infimum of all a > for which (5) holds is zero, and the claim follows. 

It remains to prove Proposition 3.2. This will occupy the rest of the paper. 

4. The wave packet decomposition 

As in the arguments 8 of Bourgain [3], [4], Wolff [28], and others, the next step 
is to decompose the functions f\do\ and f2da2 into wave packets concentrated on 
R x tubes. 

Fix R ^> 1 (the case R ~ 1 being trivial), and let j = 1,2. Let Z" be the 
standard integer lattice in R™. We shall need a spatial grid X := i? 1 / 2 Z™ and a 
velocity grid V := R~ 1/2 Z n . We let Vj be those velocities 9 v e V such that (l,v) 
is normal to Sj, i.e. (— ^|u| 2 ,w) € Sj. 

We shall work on the spacetime slab [R/2,R] x R™. We define a Sj-tube to be 
any set of the form 

T := {{t, x):R/2<t< R; \x - (x(T) + tv(T))\ < R 1/2 }, 

where x(T) e X is the initial position of T and v(T) e Vj is the velocity. 

We shall need the following standard wave packet decomposition (this is the 
parabola analogue of the cone decompositions in [28], [21], [10], and is also implicit 
in [3], [22]): 

Lemma 4.1. Let j = 1, 2, and let fj be a smooth function on Sj. Then there exists 
a decomposition 

(6) f,da, ^cio, 

To 

where Tj ranges over all Sj -tubes, the complex-valued co-efficients ct 5 obey the l 2 
bound 

(7) (EKi 2 ) 1/2 ^n/iii^ 

and for each Tj , the wave packets 4>Tj are free Schrodinger waves, where for each 
R/2 < t < R, the function <^Tj(i) has Fourier transform supported on the set 

(8) U G R n ■ £ = v(Tj) + OiR- 1 ' 2 )} 

8 The basic idea of using wave packet decompositions to attack restriction and Bochncr-Ricsz 
type problems goes back to Fefferman and Cordoba. 

9 Note that because of our normalization of the paraboloid (2), the group velocity v is exactly 
equal to the spatial frequency £; physically, this is just de Broglie's relation mv = p = under 
the normalization m = H = 1. Thus we will not bother to make much of a distinction between 
velocity and frequency in this argument. 
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(informally, 4>Tj has frequency v(Tj) + 0(R 1 ^ 2 )) and obeys the pointwise estimates 

(9) Ifc., (t, x)\ < C N RT»'\l + 

/or oiZ x € J?™, and any iV > 0. In particular, outside of the tube 

R S Tj := {(t, x):R/2<t< R; \x - {x(T 3 ) + tv(Tj))\ < R 1/2+s }, 
we have the estimate 

(10) |<M*>*)I js^ 100 "- 

Finally, any collection Tj of Sj -tubes, we have the probability estimate 

(11) P( E ^■)<#T j . 

Proof We first prove the Lemma under the assumption that fj is supported on a 
cap of the form 

(12) {(T,a£S j: ^HOrt 

for some fixed u€ V,-; this assumption will be removed at the end of this proof. 

From the Poisson summation formula we may find a Schwartz function r\ whose 
Fourier transform is supported in a disk -D(0, C) C R™ such that X^eZ" v{ x ^h) = 

1. Let F(x) := fjdo-j(0,x) denote the initial data of fjdaj. We thus have the 
decomposition 

x Q £Xj 

Observe that the spatial Fourier transform of r]{ x ~ 1 ^° )F{x) is supported on a disk 
{£ € R" : £ = v + 0(i? -1 / 2 )}. Thus if we let u Xo be the unique Schrodingcr wave 
with initial data u Xo (0,x) := n ( ^ ^ 2 ° ) fj da j ( , x ) , then we have the decomposition 

fjdaj{t,x) = E w *o 

Now let Tj be a 5,-tube with u(Tj) = u. We write c Tj := R n / i MF(x(T j )), where 

MF(x) :=sup — 1— / |F| 
r>o |£>(a;,r)| i fl(xT) 

is the Hardy-Littlewood maximal function of F, and write 0^ := u x ^ T .- ) /cT j - Thus 
we have 

/..' /rT ,< E CT ^ T i' 

thus giving a decomposition (6) (setting ctv = <j)Tj — for v(Tj)^=v). Since F has 
Fourier transform supported in the disk (8), it enjoys a reproducing formula of the 
form F = F * tp where the reproducing kernel ip — ip v has Fourier support in a 
(slight enlargement of) the disk (8), and obeys the pointwise bounds 

mx)\ <C N R- n ' 2 {l + \x\/R 1 l 2 )- N 



8 



TERENCE TAO 



for any N > 0. From this it is easy to see that MF{x) ~ MF(x') whenever 
\x-x'\ < R 1 / 2 . Thus 



E M% / \MF(x)\ 2 dx<\\Ff 2 <\\f : 



3 112 



by the Hardy-Littlewood maximal inequality and PlancherePs theorem; this gives 
(7). 

By construction, the Fourier transform of fa (0) (and hence fa j (t) for any t) is 
supported in the set (8). Now we prove (9). By construction, it suffices to show 
the pointwise estimate 

(13) \u X0 (t, x)\ < CW(1 + lx - { *Z + tv)l )- N MF(x ) 

for all xo G X and t ~ R. By translation invariance we may take xq = 0. 

There are several ways to prove this estimate; for instance, one can observe that 
(12) is contained in an 0{R- r ) x 0{R-^ 2 ) disk with normal (l,v) and use some 
form of the uncertainty principle. Another way to argue is as follows. From the 
fundamental solution of the free Schrodinger equation we have an integral repre- 
sentation of the form 

U^X)^^' 2 j e ^|x- y |V^ ( ^_ )F( y) d y. 

Recall the reproducing formula uo — u * ip v . Thus we have 

«o(t, Z) - Ct- n / 2 J K v (x- y)r,(Jfa)F(y) dy 
where K v is the kernel 



K v (x) := J J e^W^Mv) dy. 



A routine stationary phase computation 10 , using the decay and Fourier support 
properties of cf) v , gives the bounds 

\K v {x)\<C N R- n ' 2 {l + ^-^)- N 

for all N > 0. The claim then follows (13) from a direct computation. 

The estimate (10) follows from (9), so it remains to prove (11). Since the prob- 
ability is time-invariant, it suffices to show that 

J I E <M0,z)| 2 dx<#Tj. 

T^Ty.viT^v 

But this follows directly from (9), since the tubes Tj with fixed velocity v(Tj) = v 
all have distinct initial positions x(Tj), which are separated by > R}l 2 . 



10 The reader may wish to simplify the calculation by first taking advantage of Gallilean invari- 
ance to reduce to the case v = 0, and then using the scale invariance of the Schrodinger equation 
to reduce to the case _R = 1. 
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Now we remove the hypothesis that fj was supported in a cap (12). For general 
fj, we may of course decompose fj = ^ veV . fj.v, where each fj, v is supported in 
the cap (12) associated to v, and we have the L 2 bound 



J 112- 



(14) £ \\kv\\l ~ II/: 

vev 3 

One can then apply the previous arguments to fj iV , obtaining a decomposition 

fi.v = X CT ^ T i 

T 3 :v(T 3 )=v 

obeying all the above properties. Summing over all v we obtain a decomposition 
(6) of fj, which then obeys (7) thanks to (14). The properties (9), (10), and the 
Fourier support in (8) have all been proven, so it remains to show (11). But we 
have already proven the special case 

P{ E <j> Tj )<#{Tj€Tj:v(Tj) = v} 
T,eT>(T 3 )=« 

for all v £ Vj ; the claim then follows by summing in v and exploiting the frequency 
space orthogonality (via the support property (8)). ■ 

We can now begin the proof of Proposition 3.2 in earnest. Fix a, and let Qr 
denote the cylinder 

Q R := {(t,x) : R/2 < t < R; \x\ < R}. 
It will suffice to prove the estimate 

\\lXhd^\\ T ^ (n ^{R [1 - S)a + R cs )\\hh\\hh 

(Qr) 

for all smooth f\, fi on Si, S*2, since any ball of radius R can be covered by O(l) 
translates of Qr. Here and in the sequel our implicit constants in or < arc 
allowed to depend on S. 

Fix /i, fi; we may normalize H/1II2 = II/2II2 = 1- We apply Lemma 4.1 to both 
/1 and / 2 , writing 

for j = 1, 2, where Tj ranges over S^-tubes. It thus suffices to show that 



We first remove some minor portions of this sum. Let us first consider the contri- 
bution when T\ and T2 are both disjoint from B(0, CR). In this case the bound (9) 
gives bounds of O(R~ 100n ) for both <j>r 1 and 4>t 2 i with the bound improving even 
more as Ti and T 2 move away from B(0,CR). Since the coefficients ct 15 ct 2 are 
bounded by (7), the contribution of this case is easily seen to be acceptable. 

A similar argument disposes of the case where T± is disjoint from B(0, CR) and 
T 2 intersects B(0,CR), as in this case there are only 0(R 2n ) possible values of T 2 . 
Similarly when T 2 is disjoint from B(0,CR) and Ti intersects B(0,CR). Thus we 
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may henceforth restrict ourselves to tubes which intersect B(0, CR). In particular, 
the number of tubes 7\ under consideration is now only 0(R 2n ), and similarly for 
T 2 . 

We can now eliminate the contribution of the terms where ct x = O(R~ 100n ) 
or ct 2 = O(R~ W0n ), since those terms can be easily controlled just by using L°° 
bounds on <f>r 1 , 4>t 2 (from e.g. (9)). Thus we only need to restrict ourselves to the 
tubes T\ where R~ 100n < ct x < 1, and similarly for T%. 

By pigeonholing the interval [R~ 100n , 1] dyadically into 0(logR) groups, and 
noting that logi? w 1, we may thus restrict the Ti summation to the tubes where 
CTi ~ 7i for some fixed /J^ 100 " < 71 < 1. Let Ti denote the set of all tubes T\ of 
this form; from (7) we have (#T i ) 1 / 2 < 7^ . We may as well assume that ct x = 71 
for these tubes Ti 6 Ti, since we can absorb the factor c^/ji harmlessly into <j>r 1 . 
Similarly, we may restrict the tubes T2 to a collection T2 with (#T2) 1 ^ 2 < 72 -1 
and ct 2 = 72 for all T2 € T2, for some R~ 1QOn < 72 < 1. It thus suffices to prove 

Proposition 4.2. We have the estimate 

( 15 ) II E E <hM T n±s, n £(i? (1 - 5)Q + J R c5 )(#r 1 ) 1/2 (#r 2 ) 1 / 2 

Tie Ti r 2 e T 2 ^ (Qr) 

/or all collections T\, T 2 0/ S\-tubes and S2-tubes respectively, such that all the 
tubes intersect B(0, CR). 

It remains to prove this Proposition. This will be done in the next few sections. 

5. Localization of tubes, and the inductive argument 

We now utilize the inductive hypothesis (5). The idea (due to Wolff [28]) is to 
give each wave packet 0Ti an d 4>t 2 a slightly smaller ball of radius R 1 ^ 5 which 
it can "exclude" via the inductive hypothesis; it will then suffice to verify the LP 
estimate on the exterior of these balls. This is similar to the "two-ends" reduction 
used in the Kakeya problem, see e.g. [26]. 

We turn to the details. We may cover the cylinder Qr by about 0(R CS ) finitely 
overlapping spacctimc balls B of radius i? 1-5 ; let B denote the collection of such 
balls. We can thus estimate the left-hand side of (15) extremely crudely 11 by 

(16) En E E ^M\ L m (B) - 

see TieTi T 2 eT 2 

Suppose we have some relation ~ between the tubes in Ti UT2 and balls in £>; we 
will specify this relation much later in the argument, but roughly we will associate 
T ~ B if the contribution of 4>t to the bilinear expression StigTi St 2 gT 2 < t ) T 1 4>T 2 
is "concentrated" in B. We can then estimate (16) by the "local part" 

( 17 ) Eik E M( E ^\\ L m (B) 

see TigTVti-b T 2 eT 2 :T 2 ^s 



11 Clearly we may improve on this by replacing the I 1 summation over balls B with an 
summation. This refinement is exploited in the endpoint theory, see [21], but is 
unnecessary for the non-endpoint case. 
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and the "global part" 

( 18 ) En E ^M L m ( B) - 

Bets T 1 eT 1 ,T 2 eT 2 -.T l7 tB or t 2 /b 

Consider the contribution of the local portion (17). From the probability estimate 
(11) we see that for each B € B and j = 1,2, Et £ T t * s a ^ ree Schrodinger 

wave with probability 

' 'I' - i; 

By applying the induction hypothesis (5), we may thus bound (17) by 

(17) < R (1 ~ S)a {#{Ti G Ti : Tj ~ B})V2(#{T 2 eT 2 :T 2 ~ 5}) 1 / 2 , 
see 

which by Cauchy-Schwarz becomes 

(17) <R^HY. E !) 1/2 (E E !) 1/2 - 

BGB TisTi:Ti~B SeB T 2 eT 2 :T 2 ~B 

Thus, if we make 

Assumption 5.1. For a// T e I\ U T 2 , we have 

(19) #{BeB:T~B}<l, 
then we can bound (17) by 

(17) < i?( 1 - 4 > a (#Ti) 1 / 2 (#T 2 ) 1 / 2 

which is acceptable. 

Roughly speaking, Assumption 5.1 asserts that each tube T <G Ti UT 2 is allowed 
to exclude 1 balls B from the summation in (18). It is thus natural to select 
~ so that each tube T excludes the ball B in which its "contribution" to (15) is 
"greatest"; this will become clearer when we define ~ in Section 8. 

It remains to estimate (18). It will suffice to show that 

(20) ii ]r ^ T2 \\ LmiB) <R cs mi) 1/2 m 2 )^ 

for all B £ B, since the claim then follows by summing in B. Note that a no longer 
plays any role; we will not need the induction hypothesis (5) in the remainder of 
the argument. Also, we can now freely lose powers of R s in what follows. 

Fix B; it remains to prove (20). By the triangle inequality, it will suffice to prove 
that 

(21) ii ]r ]T ^T 2 \\ L ^ (B) ^R C5 mi) 1/2 m2) l/2 

TieTi:Ti/B T 2 gT 2 

and 

(22) || Yl E 0T 1 0T 2 || L ^ f(B) ^i? C5 (#T 1 ) 1 / 2 (#T 2 ) 1 /2. 

T 2 eT 2 :T 2 /BTiGTi:Ti~B 
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The two claims are proven similarly (the expression (22) is slightly smaller, but 
the extra constraint T\ <~ B turns out to play no significant role), and so we will 
content ourselves with proving (21). (The definition of the equivalence relation <~ 
will be symmetric with respect to Ti and T 2 ). 

n + 3 

We follow Wolff's strategy of obtaining the bilinear L"Tt estimate by interpolat- 
ing between bilinear L 1 and L 2 estimates. The bilinear L 1 estimate follows easily 
from linear L 2 estimates: 

Lemma 5.2. We have 

II E E ^ 1 ^T 2 |U 1 (B)^ J R(#T 1 ) 1 / 2 (#r 2 ) 1 /2. 

T 1 eT 1 :T 1 ^BT 2 eT 2 

Proof By Holder's inequality it suffices to show that 

|| E 0T 1 || ia ( B )£fl 1/2 (#T 1 ) 1 /2 
TieTVTi/B 

and 

II E <l>T 2 \\Li(B)£R 1/2 (#T2) 1/2 . 

T 2 eT 2 

But these follow directly from (11) and an integration in time (since B is contained 
in the slab [-R, R] x R"). ■ 

From Lemma 5.2 and Holder's inequality (or the log-convexity of LP norms), it 
will suffice to prove the L 2 estimate 

(23) ii e E MTjL^^R^R-^^m^ 2 ^) 1 / 2 ; 

TieTi:Ti/BT 2 eT 2 

note how this uses the choice of exponent . 

The exponent i?~(™ -1 )/ 4 is best possible. To see this, let 7r denote the spacetime 
disk 

7T := {(t,x\ei) : t,x\ — O(R)}, 

and consider the example when Ti consists of the 0{sfR) tubes with velocity e\ 
which intersect n, while T 2 similarly consists of the 0(>/R) tubes with velocity —e\ 
which also intersect the 7r plane. By (9), the left hand side is essentially of magni- 
tude 0(R- n / 2 ) on a 0(\/i?)-neighbourhood tt (which thus has volume i?( n+3 )/ 2 ), 
and the numerology of (23) follows. (This is of course the same counterexample 
which shows that the exponent is best possible; see [22], [28]). 

6. Fine-scale decomposition 

In the previous part of the argument, we have decomposed the cylinder Qr 
(which is essentially a spacetime ball of radius R) into slightly smaller balls B of 
radius i? 1_<5 in order to utilize the induction hypothesis. To continue the argument 
we must decompose B into much smaller balls, namely balls of radius \/R, to fully 
exploit the spatial localization of the tubes T. Specifically, we cover (a slight dilate 
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of) Qr by a finitely overlapping collection q of balls of radius \/R~. Squaring (23), 
it thus suffices to show that 

( 24 ) E II E E ^T 1 0T 2 || 2 L2(9) ^i? C5 i?- ( - 1)/2 (#T 1 )(#T 2 ). 
qeq-. q c2B Tl eT 1 :T 17 0BT 2 eT 2 

First consider the contribution to (24) of the case where Ti n R s q = 0. In this 
case, it is easy to see from (10) and the triangle inequality that this contribution 
is certainly acceptable. Thus we only need to consider the terms in (24) where T\ 
intersects R 5 q. Similarly we only need to consider the terms where T 2 intersects 
R 5 q. 

It remains to show 

( 25 ) E II E E 0T 1 0T 2 ||i 2(9) £i? C5 i?- ( - 1)/2 (#T 1 )(#T 2 ) 
q eq:qc2B Tl e Tf B ( 9 ) t 2 eT 2 ( 9 ) 

where 

T, (<Z) := {Tj e T, : Tj n R 5 q^%} for j = 1, 2 
Tf B (q) := {Ti e Ti(g) : Ti ^ B}. 

We now do some dyadic pigeonholing, first on the multiplicity of the tubes Ti, 
T 2 through q, and then on the multiplicity of the balls q within T\. For any dyadic 
numbers 12 1 < £ i? 100 ", let q(/ii,/i 2 ) C q denote the set 

q(Mi,M2) :={?£q: Hj < #Tj(q) < 2(j,j for j = 1, 2}, 

thus the q(^i,^ 2 ) cover all the balls q e q for which the summand in (25) is non- 
zero. Since there are only O(log-R) 2 w 1 possible values of (fi\,fi2), it thus suffices 
to show that 

E 11 E E 0T 1 0T 2 n| 2(9) ^i? C5 i?- ( "- 1)/2 (#T 1 )(#T 2 ) 

g€q(Mi,M2):9C2B Tl eTf B ( 9 ) T 2 eT 2 (g) 

for all fi-i,^- 

Fix fii,fi2- For any Ti e Ti, let A(Ti,^i,^ 2 ) denote the integer 
A(Ti,/ii,/i 2 ) := #{<z G q(/ii,/i 2 ) : Ti n i2*g?fe0}, 
and for every dyadic number 1 < Ai < i? 100 ", let Ti[Ai, Hi, /x 2 ] denote the set 

(26) Ti[Ai,/ii,/i 2 ] := {Ti € Ti : Ai < A(Ti, Ml , /i 2 ) < 2X 1 }. 
Since there are only 0(logi?) w 1 values of Ai, it thus suffices to show that 
(27) 

E II E E 0T 1 0T 2 |li 2(9) ^i? C5 i 1 - ( "- 1)/2 (#T 1 )(#T 2 ) 

9 eq( MllM2 ): 9 c2B Tl eTf B ( 9 )nT 1 [A ll( ix 1 ^ 2 ] T 2 eT 2 ( g ) 

for all Ai. (We could also pigeonhole the multiplicity of balls in T 2 in a similar 
manner, but this will turn out to be unnecessary). 

12 By dyadic number we mean an integer power of two. 
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Fix Ai. We still have to prove (27). At this point we pause to recall how the 
analogous argument of Wolff [28] proceeded for the cone (for which the tubes T\ , 
T 2 are constrained to point in null directions). Firstly, by a Plancherel argument 
(similar to an argument of Mockenhaupt [11]), Wolff observed the local estimate 

(28) 

ii Yl Yl faiMl'Ma 

T 1 eTf B (g)nT 1 [A 1 , / . ll([l2 ] T 2 eT 2 ( 9 ) 

^ jR -(n-l)/2 #(T /B WnTi[Ai ^ i)M2])2(#T2W); 

this is basically a consequence of the fact that for fixed T\ , the functions </>Ti 4>t 2 are 
almost orthogonal on q. From (28) it would then suffice to show the combinatorial 
estimate 

(29) Yl #(Tf B ( g )nT 1 [A 1 , m , M2 ]) 2 (#T 2 ( ( z))£i 1 ' C5 (#T 1 )(#T 2 ). 

qeq(fJ.i,^ 2 )-qC2B 

This estimate is true in the case of the cone (see the Remarks section) but does 
not appear to hold for the paraboloid case. To resolve this difficulty we need to 
sharpen the local estimate (28); this is the purpose of the next section. 

7. An improved local estimate 

Before we present our improved version of the local estimate (28), let us begin 
with an informal discussion. Suppose we wish to estimate a quantity of the form 

(30) ii Yl E fc-itoJi?.. 

TieTi t 2 gT 2 

where we shall be careless about exactly what region of spacetime we are integrating 
over. We can expand this expression as 

Y E E E^^'^i'^)- 

TxeTiT.eT.^eTi ^ 

Now if T\ has velocity £i, then the spacetime Fourier transform <pr 1 should be 
supported near the point (£i, — ^|£i| 2 ) in S. Similarly if T 2 has velocity £ 2 , T[ has 
velocity £[, and T 2 has velocity £ 2 . From Parseval's formula, we thus expect the 
above inner product to be very small unless £i +£ 2 is close to !;[ +£ 2 and |£i | 2 + 1£ 2 1 2 
is close to |£ | 2 + |^ | 2 . 

Suppose we fix two of the frequencies, say £i and £[. Then the relation +£ 2 = 
Ci + C 2 w iU correlate £ 2 and ^ 2 , in the sense that either of these two frequencies 
will determine the other. This basic observation is already enough to give a bound 
for (30) which is proportional to (^Ti) 2 (^T 2 ), and by making these ideas slightly 
more rigorous one can soon obtain the bound (28). However, as we will soon see, we 
can do better by also exploiting the additional constraint |£i| 2 + |£ 2 | 2 = |^| 2 + |£ 2 | 2 
to remove one more degree of freedom on the collection T 2 , which will eventually 
make this collection behave sufficiently similar to the collection of tubes in a light 
cone that Wolff's argument will apply. 
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We need some notation. Let Q 2 C R™ denote the spatial frequency regions 



n i: ^ e R":K- ei |<^L} 

tt 2 :-{eeR":|e + ei |< 2^}; 

note these are slightly larger than the spatial frequency supports of Si and 5*2 
respectively. For any £1 £ r2i,6 G Q 2 , let 7r(^i,^ 2 ) ^ ^1 denote the set 



(31) 



■■= {& € Or : a +6 = £ + & 16 1 2 + 161 = ICil 2 + l& 
for some 6 G ^2}; 



£■/ I 2 



one can interpret this set as being equivalent to the set of all parallelograms with 
two vertices in (a slight enlargement of) Si and two vertices in (a slight enlargement 
of) S 2 . 

A little algebra shows that 7r(£i> £2) ^ s contained in the n — 1-dimensional hyper- 
plane in R" which contains 6 and is orthogonal to £ 2 — 6 (cf- the calculations in 
[2], [12], [13], [22]), or in other words 13 



(32) 



(6-6^2-6)R« =0 whenever £ £tt(^,Q. 



Indeed, the points £,1, Cii & , £,' 2 f° rm a rectangle in R™. 

For any ball q <E q(/ii,^t 2 ) and any two frequencies 6 € f2i, £ 2 G ^2, let 
Tf B (q, Ai,/ii,//2,6) £2) denote the collection of those tubes T\ G Tf s (g)nTi[Ai, /ii, ^2] 
such that the velocity i>(Ti) of Ti is within 0{R C5 R- 1 / 2 ) of the set 7r(£i,£ 2 ). Let 
v(q, \i, (i 2 ) denote the quantity 

(33) v(q, Ai,/ii,At 2 ) := sup #Tf B (q, Ai, /xi, M2, 6, £>)■ 

We now prove the following refinement of (28). 
Lemma 7.1. For any q e q(fi\ , /i 2 ) ; we /icwe 

2 



E 



Tx e Tf B (q)n Ti [Ai , M1 , M2 ] T 2 e T 2 (,) 



< 



L 2 (l) 



fi C5 J R-("- 1 ^ 2 K?,A 1 , m , M2 )(#(2f B (g)nT 1 [A 1 , Ml , M 2]))(#T 2 (g)). 

Proof Our arguments here have certain similarities to those in [11], [2], [12], [13], 
[22], and can ultimately be traced back to the L 4 theory of Fefferman, Sjdlin, and 
Cordoba. 

For brevity let us write 

Ti ^Tf^nTMAi./ii,/*,] 
T' 2 :=T 2 (q) 
v := v{q,\ U fi U fi 2 )- 



13 This orthogonality is not absolutely essential to the argument; what is important (particu- 
larly in the proof of Lemma 8.1) is that the set w^i,^) is contained in a hypersurface which is 
transverse to ^ — £1, or indeed to any vector in SI 2 
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Since the L 2 (q) norm is bounded by the global L 2 x norm, it suffices to show that 

(34) || Yl E ^T 1 <AT 2 ||| ? ^ J R C ^- ( "- 1)/2 (#Ti)(#T^) i ,. 

T ie T;r 2 eT; 

Note that a straightforward calculation using PlanchcrePs theorem shows that 
4>Ti4>T2 is globally in L 2 x . Indeed, from (9) we see that the spacetime Fourier 

transform of $>t x is of the form /t x dai, where /t x is supported on a cap {(r, £) € 
5 : £ = w(Ti) + 0(R-^ 2 )} and has magnitude 0(i? n / 4 ). Similarly for cfo. A 
computation using the transversality of S± and S 2 thus shows that the spacetime 
Fourier transform of (f) Tl <f) T2 is supported on the spacetime ball 

(35) B((-^(T 1 )| 2 ,^T 1 )) + (-^(T 2 )| 2 , W (T 2 )),Ci?- 1 / 2 ) 
and has magnitude 0{R 1 / 2 ). In particular we have 

(36) HTM\ L i x <R- {n - 1)/4 - 

We now return to (34). We expand the left-hand side as 

(37) | E (^tAtzAt^t^liJ- 

Ti.Tj'eTi T 2 ,T^T' 2 

From (36) and Cauchy-Schwarz we see that the inner product is 0{R- {n ~^/ 2 ). 
On the other hand, from the Fourier support (35), we see that the inner product 
vanishes unless 

(38) «(Ti) + v(T 2 ) = v(T{) + v(r 2 ) + OiR- 1 ' 2 ) 
and 

K7\)| 2 + \v(T 2 )\ 2 = \v(T{)\ 2 + \v(T>)\ 2 + 0{R~" 2 ). 

In particular, we see (using the separation of fix and f2 2 ) that for fixed Ti, T 2 , 
the velocity v(T{) must lie within 0{R- 1 / 2 ) of the hyperplane •k(v{Ti),v{t()). In 
particular for fixed Ti, T 2 there are at most 0{y) choices for v(T[), and hence 
0{R c5 u) choices of T[ (since by construction of T' 1; T[ must intersect R s q). For 
fixed Ti, T£, T 2 there are at most 0(1) choices of T[ by (38), and hence 0{R CS ) 
choices of T[. Combining all these facts together, we see that we can bound (37) 

by 

(#T;x#T 2 )i? Cd Vi? c5 ir (n - 1)/2 , 

as desired. ■ 

To conclude the proof of Theorem 1.1, it thus remains to prove the combinatorial 
(Kakeya-type) estimate 

Ai , Mi , M2) (#(Tf B (q) n Ti [Ai , mi , /*])) (#T 2 (<?)) 

(39) 9eq(/u,fti2):gc2B 

< i? c5 (#T 1 )(#T 2 ) 

for an appropriate choice of equivalence relation <~ obeying Assumption 5.1. This 
will occupy the next section. 
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8. The combinatorial estimate 



We now prove the combinatorial estimate (39). Interestingly, this estimate is 
of a comparable level of difficulty to the corresponding combinatorial estimate 1 in 
[28], and in particular does not need any additional Kakeya-type information. (The 
numerology is similar to the (n + 2)/2 Kakeya estimate in [26], but the argument 
here seems simpler than the "hairbrush" argument in [26], though of a somewhat 
similar flavor). 

We first need to define the relation <~. For each tube Ti £ Ti[Ai, ^2], let 
B(T\, Ai, /Ui, ^2) be the ball in B which maximizes the quantity 



(40) #{q e q(/xi,M2) : Ti n R S q0; q H B(T U X u Ml , M2 )^0} > iT ca Ai. 



We define the relation <~Ai,^i l( u 2 between tubes in Ti and balls in B by defining 
Ti ~A llMl)M2 B' if Ti e Ti[Ai,'/ii,M2] and £' C 10B(Ti, Ai, m, /z 2 ); note that this 
definition is independent of the ball B which appeared in the previous section. 
Clearly for each tube Ti there are at most 0(1) balls B' such that Ti ~Ai,/u,^ 2 B' ■ 
Then we define Ti <~ _B' if one has Ti ~Ai l( ui l( u 2 ^' f° r some dyadic Ai,/ii,/i2! it is 
then clear that (19) holds for T € TV We then define ~ between T2 and B by a 
completely symmetrical procedure (although we will not need ~ for T2 here as we 
are proving (21) instead of (22)). 

Now we prove (39). By definition of q(^i, ^2), we have 



#{?£ q(Mi,M2) iTxCi? 5 



< Z ^0;gn J B(T 1 ,A 1 ,Mi,M2)^0}- 



From the pigeonhole principle and (26), we observe that 



(41) 



#T 2 (?) 



for all q in (39). Also, by Fubini's theorem and (26), we have 



(42) 

E #(Tf B (g)nT![Ai])< ]T ^Ti^nTilAi]) 



3eq((Ui l( u 2 ):gc2S 9 eq(pi^ 2 ) 



= E q(Mi,M 2 ) tTiHiT-VS} 



TieTi[Ai] 



£ E a, 




Thus to prove (39) it will suffice to show that 



(43) 



u(q , Ai,/ii,// 2 ) ^ i?' 



,C<5# T 2 



A1/Z2 



for all g € q(fi\, ^2) with g C 25. 



14 In our notation, the combinatorial estimate in [28] is essentially (29), but with the tubes Tj 
restricted to light rays. See also the remarks section. 
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It remains to prove (43), which we shall do using a "bush" argument centered 
at <7o- Fix g € q(Mi,M2) with q C 2B, and let £1 £ fii, £ 2 € ^ 2 be arbitrary. Let 
Ti denote the set 

(44) Ti :=Tf B (g ,A 1 ,Mi,M2,6,e2) 
defined in Section 7. By (33), it suffices to show that 

(45) #T' X < R cs f^. 

Let Ti e Ti. By construction, wc have 7\ e Tf B (q ) and T x <= Ti [Ai , m , fj, 2 ] ■ 
In particular, we have T\ n R s qo^$, and B ^ 10B(Ti, Ai, /Ui, ^)- In particular, 
since go C 25, we have 

dist(<zo, 2B(Ti, Ai,/ii, A*2)) £ i?~ c5 i?. 

By (40), we thus have 

#{<Z G q(Mi,M2) : T x n i? 5 ^0;dist( (Zo ,g) £ ir c5 i?} > R- C5 \i. 

On the other hand, by the definition of q(/ii, fJ, 2 ), for each g <G q(/ii, ^2) there are 
j«2 tubes T 2 in T 2 which intersect J? 5 ?. Thus we have 

#{(q,T 2 ) € q(Mi,M2)xT 2 : TtDR^, T 2 nR 5 q^<D; dist(g , ?) £ iT C5 i2} > ir c5 A lM 2- 

Summing over all Ti in Ti, we obtain 

(46) 

#{{q,T u T 2 ) e q x Ti x T 2 : T\ n i? d <z,T 2 n distfcfo, ?) £ > R~ cs X^T^. 

Now we make the following crucial geometric observation, which is analogous to 
the geometric observation used in [28] that a light ray can transversally intersect a 
light cone in at most one point: 

Lemma 8.1. For each T 2 € T 2 , we have 

Ti) e q x Ti : Ti n R s q, T 2 n i? <Mg , ?) £ iT c5 i?} < i? c5 . 

Proof Let (t ,x ) and (t, x) denote the centers of qo and q respectively. Since T\ 
intersects both R 5 qo and R s q, and dist(g r o, q) ~ R~ CS R, we see that 

iT C4 i2 < J* — * | ^ ^ 

and 

x - x = «(Ti)(t - t ) + 0(R cs R 1/2 ). 

On the other hand, since Tj e T' l7 we see from (44) that u(Ti) lies within 0(R° s R~ 1/2 ) 
of7r(^i,^ 2 ). Thus we have 

dist( ^ )7r(6 ,^))<^ jR -l/2. 

On the other hand, if we let e := £ 2 — (,1, then from (32) we see that 
(£-&,e> R » =0for all 
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and hence 

(^- ei ,e) R .<^-V». 

We may rearrange this as 

((t - t , x - x ), e) R », e)) RxR » < R^R 1 ' 2 . 

Thus (t, x) lies within 0(R cs R 1 / 2 ) of the n-dimensional hyperplane II in R x R n_1 
which passes through (to,xo) and which is normal to (— (£1, e) R «, e). But since 
£1 G fii, £ 2 € ^2, we see that e is within l/5n of — 2ei, and — (£i,e) is within l/5n 
of +2. Since ^(T^) is within l/5n of — ei, we thus see that T 2 makes an angle of <~ 1 
with respect to II. Since dist((i, x), T 2 ) ;$ R CS R 1 / 2 , we thus see that (t, a;) is thus 
constrained to lie within a ball of radius R CS R 1 ^ 2 . This means that there are only 
at most 0(R CS ) choices for q. For each fixed q there are at most 0(R CS ) choices 
for Ti, and the claim follows. ■ 

Combining this Lemma with (46) we see that 

R C5 #T 2 > ii- C5 AiM2#T' 1 
and (45) follows. This concludes the proof of Theorem 1.1. ■ 



9. Remarks 

The proof of Theorem 1.1 is very similar to the argument in [28]. Indeed, 
one can compare the arguments as follows. For the cone, the passage 
to localized restriction estimates, wave packet decomposition, induction 
on scales, and fine scale decomposition works almost exactly the same as 
with the parabola, the only major difference being that the tubes are now 
oriented along light rays 15 . For the localized estimate, (28) is used instead 
of Lemma 7.1. This requires us to prove (29). Using (42) and (41) as in 
Section 8, one reduces to showing that 

#(Tf B (q ) n Ti[Ai,/ii,/i2]) £ R c6 f^. 
Arguing as in Section 8, this reduces to showing the estimate 
#{(<?, 7i) e q x Ti : Ti n R 5 q,T 2 n R s q^; dist(go, q) g R~ CS R} £ R CS ■ 

(compare with Lemma 8.1). But this follows in the cone case since the 
tubes Ti which intersect R s q are contained in a i? 1 / 2+5 -neighborhood of a 
light cone; since the tube T2 is concentrated around a light ray, intersects 
Ti transversally and at a distance ^ R~ CS R from the vertex of this light 
cone, the claim then follows from elementary geometry. 



15 Also, the tubes have a more interesting internal structure, being composed of somewhat 
thinner 1 X i? 1 / 2 X R "plates", but this ends up not being very relevant to the argument which 
follows. See [28], [21] for further discussion. 
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• It may well be possible to eliminate much of the pigeonholing in the above 
argument, and perhaps even eradicate the epsilon loss in Theorem 1.1. (See 
for instance [20] for a non-pigeonholed version of the argument for the cone 
in [28], and [21] for the endpoint result). However, it seems difficult to 
access the v parameter without this pigeonholing, and we do not know how 
to remove the epsilons in the paraboloid case. 

• The geometric properties of the paraboloid which were used in the above 
argument (and especially in Lemma 8.1) are easily seen to be robust under 
small perturbations of the paraboloid. In particular, one can easily obtain 
Theorem 1.1 for all disjoint compact subsets of a compact hypersurface of 
elliptic type as defined in [13], [22], providing that the parameter e used to 
define elliptic type is sufficiently small. We sketch this as follows. Let S 
be a surface of elliptic type; after some linear transformations, this means 
that S is of the form 

5:={(r,C)eRxR":r=-i|e| 2 +e/(0} 

where the error function /(£) is smooth, and e is a sufficiently small param- 
eter (depending on the smooth norms of / and on the size and separation 
of Si, 1S2). In other words, S is a small perturbation of the paraboloid 
(2). This means that the dispersion relation v = /i(£) between the group 
velocity v and the frequency £ is not quite the identity (in fact, it is given 
by /i(£) := £ — eV/(£)), but it will still be a homeomorphism and a small 
perturbation of the identity on 5i U S2 if £ is small enough. Aside from 
making this distinction between velocity and frequency, the arguments in 
Sections 3-6 are essentially unchanged. In Section 7, the set 7r(£i,£ 2 ) must 
be replaced by 

H£ e Hi : £1 + 6 = £ + £ 2 , |£i| 2 + I6I 2 = l£i| 2 + l£ 2 | 2 
+2e(/(&) + /(£ 2 ) - /(£) - f(Q) for some £ 2 e 2 }, 

but this is easily seen to be a small smooth perturbation of 7r(£i,£ 2 ). Ac- 
tually, because the dispersion relation v = /i(£) is no longer the identity, 
the relevant set is not 7rs(£i,£ 2 ) but rather h(ns(€i, £2))) but this is still 
a small smooth perturbation of 7r(£i, £ 2 ), and in particular retains the key 
property of lying in a hypersurface transverse to Q2 — f2i . 
Now the remainder of the argument continues as before, with the obvious 
modifications, until we reach Lemma 8.1. Now (t, x) will not lie within 
0(R cs R 1 ^ 2 ) of a hyperplane in spacetime, but instead it will lie within 
0(R cs R 1 / 2 ) of a conic manifold 16 consisting of the union of the lines 
through vertex (t ,xo) which have velocity in h(irs(£i, £ 2 ))- If e is suf- 
ficiently small, this manifold is still transverse to T 2 , and the remainder of 
the argument proceeds as before. 

• Once we have the above bilinear restriction theorems for arbitrary disjoint 
compact subsets of surfaces of elliptic type, we can use the machinery of 
[22] to derive the analogue of Corollary 1.2 for all compact hypersurfaces 



16 In the special case when S is a sphere, then this conic manifold is in fact a circular cone, 
although the aperture and orientation of this cone depends on £1 and £' 2 . 



SHARP BILINEAR RESTRICTION FOR PARABOLOIDS 



21 



of elliptic type. After some finite partitions of unity and some affine linear 
transformations, we may thus obtain Corollary 1.2 for all compact surfaces 
for which all the principal curvatures strictly positive. In particular, the 
restriction conjecture for the sphere S n in R x R n is true for all q > 
2(n + 3)/(n+l). 

• It is also extremely likely that the same argument works when some of the 
principal curvatures are strictly negative; indeed, by combining this argu- 
ment with the argument for the cone, it seems plausible that one should 
be able to obtain good restriction estimates for all surfaces in which at 
most one principal curvature vanishes at any given point. In particular, one 
should be able to obtain bilinear restriction theorems for all non-degenerate 
conic sections when q > 2(n + 3)/(n+ 1) (thus providing a bilinear analogue 
of the linear theory in [18]). If so, this would likely give near-optimal bilin- 
ear LP null form estimates for the wave equation (see [21] for a discussion). 

• It seems likely that these arguments also give some new progress on the 
Bochner-Riesz problem for paraboloids and spheres (see e.g. [3] for a dis- 
cussion), but we have not pursued this question. 
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